Scalable anomaly detection in graphs
نویسندگان
چکیده
The advantage of graph-based anomaly detection is that the relationships between elements can be analyzed for structural oddities that could represent activities such as fraud, network intrusions, or suspicious associations in a social network. Traditionally, methods for discovering anomalies have ignored information about the relationships between people, e.g., who they know, or who they call. One approach to handling such data is to use a graph representation and detect normative patterns and anomalies in the graph. However, current approaches to detecting anomalies in graphs are computationally expensive and do not scale to large graphs. In this work, we describe methods for scalable graph-based anomaly detection via graph partitioning and windowing, and demonstrate its ability to efficiently detect anomalies in data represented as a graph.
منابع مشابه
Big Graph Mining: Algorithms, Anomaly Detection, and Applications
Graphs are everywhere in our lives: social networks, the World Wide Web, biological networks, and many more. The size of real-world graphs are growing at unprecedented rate, spanning millions and billions of nodes and edges. What are the patterns and anomalies in such massive graphs? How to design scalable algorithms to find them? How can we make sense of very large graphs? And what kind of rea...
متن کاملoddball: Spotting Anomalies in Weighted Graphs
Given a large, weighted graph, how can we find anomalies? Which rules should be violated, before we label a node as an anomaly? We propose the OddBall algorithm, to find such nodes. The contributions are the following: (a) we discover several new rules (power laws) in density, weights, ranks and eigenvalues that seem to govern the socalled “neighborhood sub-graphs” and we show how to use these ...
متن کاملA Scalable Approach for Outlier Detection in Edge Streams Using Sketch-based Approximations
Dynamic graphs are a powerful way to model an evolving set of objects and their ongoing interactions. A broad spectrum of systems, such as information, communication, and social, are naturally represented by dynamic graphs. Outlier (or anomaly) detection in dynamic graphs can provide unique insights into the relationships of objects and identify novel or emerging relationships. To date, outlier...
متن کاملCADDeLaG: Framework for distributed anomaly detection in large dense graph sequences
Random walk based distance measures for graphs such as commutetime distance are useful in a variety of graph algorithms, such as clustering, anomaly detection, and creating low dimensional embeddings. Since such measures hinge on the spectral decomposition of the graph, the computation becomes a bottleneck for large graphs and do not scale easily to graphs that cannot be loaded in memory. Most ...
متن کاملDELTACON: A Principled Massive-Graph Similarity Function
How much did a network change since yesterday? How different is the wiring between Bob’s brain (a lefthanded male) and Alice’s brain (a right-handed female)? Graph similarity with known node correspondence, i.e. the detection of changes in the connectivity of graphs, arises in numerous settings. In this work, we formally state the axioms and desired properties of the graph similarity functions,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Intell. Data Anal.
دوره 19 شماره
صفحات -
تاریخ انتشار 2015